LEVY-SHEFFER SYSTEMS AND THE LONGSTAFF-SCHWARTZ 
ALGORITHM FOR AMERICAN OPTION PRICING 
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Abstract. Glasserman and Yu (Ann. Appl. Probab. 14, 2004, p. 2090) have 
investigated the mean square error in the LongstafT-Schwartz algorithm for 
American option pricing, assuming that the underlying process is (geomet- 
ric) Brownian motion. In this note we provide similar convergence results 
for the standard Poisson, Gamma, Pascal, and Meixner processes, pointing 
out the connection of the problem to the Levy-Sheffer systems introduced by 
Schoutens. 



1. Introduction 

The least squares Monte Carlo approach by LongstafF and Schwartz [4J has be- 
come the standard method for pricing American (or Bermudan) interest rate deriva- 
tives by Monte-Carlo simulation. It proceeds by backward induction and estimates 
value functionals by regression on a prescribed set of basis functions. Its conver- 
gence analysis was commenced in the original paper [3], and was carried out in 
detail by Clement, Lamberton, and Protter [T]. In both papers it is assumed that 
the number of basis functions is fixed, while the number of Monte Carlo paths 
grows. Glasserman and Yu [3] (abbreviated GY in what follows) have analyzed 
settings in which these two parameters increase together. In particular, they have 
shown that the number of basis functions may grow only roughly logarithmically, 
if the underlying process is Brownian motion or geometric Brownian motion. 

The present note corroborates their conjecture that their analysis should be ex- 
tendable to other settings with similar results. In this way, we add some theoretical 
evidence that a large number of basis functions is detrimental on convergence, a 
plausible fact from a practical point of view 7 . On the other hand, we provide an 
application of the neat martingale properties that Schoutens ^ found for certain 
Levy processes and families of orthogonal polynomials. 

In the following section we recall Longstaff and Schwartz's algorithm and de- 
scribe the problem that GY [3] treated. Our presentation is deliberately brief; see 
GY [3] for more on the context of the problem. Section [3] recalls the notions of 
Sheffer system and Levy-Meixner system [S]. Besides Brownian motion, this theory 
yields four processes that lend themselves to the analysis: the standard Poisson, 
Gamma, Pascal, and Meixner process. In Section U] we assume that there are only 
three exercise opportunities, resulting in a single regression, and show how fast the 
number of simulation paths must increase in order to ensure convergence of the 
Longstaff-Schwartz algorithm for a growing number of basis functions. Asymptotic 
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inversion of this result yields rows two to five of Table [TJ The last column indicates 
the maximal number of basis functions for which the mean square error tends to 
zero as the number of paths tends to infinity. Finally, Section [5] contains an anal- 
ogous bound for the multi-period setting, which is weaker, but upon inversion still 
leads to the same critical asymptotic rate as the single-period case. In the course 
of the proofs, it turns out that the different critical rate of Brownian motion stems 
from the slower growth of the connection coefficients pertaining to the associated 
Levy-Meixner system, viz. the Hermite polynomials. 



Process 


Basis polynomials 


#Basis functions 


Geometric Brownian motion 


Monomials 


ViogiV 


Standard Poisson 


Charlier 


log A^/ log log 


Gamma 


Laguerre 


log A^/ log log 


Pascal 


Meixner 


log A^/ log log A^ 


Mcixner 


Meixner-PoUaczek 


log A^/ log log A^ 


Brownian motion 


Hermite 


logA^ 



Table 1 . The highest possible number of basis functions for N paths 



2. Review of the Algorithm, Previous Results, and Notation 

For the reader's convenience, our notation closely follows that of GY |3j. Suppose 
that our (discounted) asset follows a Markov process St- The American (actually 
Bermudan) option may be exercised at the times = io < • • • < tm- The payoff from 
exercise is hn{St^), for given functions /i„, < n < m. By dynamic programming, 
the option value at time to equals max{/io(5'to), Cq (S'to)}, where the continuation 
values C* are given by 

C*^{x) = 0, 

Cl{x) = E[max{/i„+i(5t„^J,C:+i(5t„+J} | 5t„ = x], < n < m. 

Longstaff and Schwartz ^ approximate these continuation values by a linear com- 
bination of basis functions ij^nk'- 

K 

CrSx) ~ ^l3nkTpnk{x) = 0^^n{x), 
k=0 

where /3„ = (/3nOj ■ ■ • j /3ni<r)"^ is a vector of real numbers which is estimated by 
regression over the simulated paths, and ipnix) — {ipna{x), ■ ■ ■ , tprtKix))"^ ■ 

The variant of the algorithm to be analyzed proceeds as follows. Start with 
the final continuation value Cm = and the final option value Vm — ^m- For 
n = m — 1, . . . , 1 generate N sample paths {5*1^', . . . , <S'j^'^^}, 1 < « < A^, and set 

1 ^ 

i=l 

Vn = ina^{hn,Cn}- 

Finally, the initial continuation value is Ca{Sa) ~ A^~^ X^iLi ^li^il''): whence the 
initial option value Vq = max{ft,o('S'o), C'o(*S'o)} is found. 
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Note that the matrix 

has to be estimated by its sample counterpart in practice, but is available explicitly 
in the examples that GY [3] and we consider. In the single-period case m = 2, the 
question that GY |5] treated is as follows. Suppose that there is a representation 

K 

(2) h2 {St^ ) = ^ akip2k (S't2 ) , 

k=0 

with unknown constants a^. This assumption is not too restrictive; an infinite series 
representation of this kind has to be assumed anyways to get convergence of the 
algorithm, and since we are interested in K oo, we can suppose that is a 
good approximation of the payoff at ^2- Furthermore, assume that the martingale 
property 

(3) E[MSt,) \ St,] = ^ik{St,) 

holds. How fast may K tend to infinity compared to N, while assuring that the 
mean square error of /3 tends to zero? To this end, they established the bounds 

(4) sup E[|/3-/3|2] < ^X^^E[V;2,(5*jVife(^*J^] 



and 



j=0 fc=0 



K 



(5) sup E[|/3-/3|2] >—L^J2n^2K{St,rMSt,r]~^. 

1/31-1 II ill f.^Q 

Here and in what follows, | • | denotes the Euclidean vector norm, and || • || denotes 
the Euclidean (or Frobenius) matrix norm. The proofs of the estimates (H]) and (O 
are short; the bulk of the work of GY [S| lies in the concrete examples (Brownian 
motion and geometric Brownian motion), and in the general analysis of the multi- 
period case (see Section [5]). 

The martingale property ^ is convenient for estimating the expectations in 
these two bounds. Another useful property is orthogonality of the basis functions. 
If St is Brownian motion, then GY [3] have shown that for N paths the highest K for 
which the mean square error tends to zero is roughly logiV. Hermite polynomials 
are natural basis functions in this case. If the underlying process is geometric 
Brownian motion and monomials are used as basis functions, then K may only be 
as high as y/\ogN. In this note we show that the analogous rate for the Poisson, 
Gamma, Pascal, and Meixner processes is in between, viz. log A^/ log log A^. 

3. Levy-Meixner Systems 

A source of basis functions and processes that satisfy martingale equalities of the 
type ([3]) are the Levy-Meixner systems introduced by Schoutens [5]. Recall that 
Meixner [5] has determined all sets of orthogonal polynomials Qk{x) that satisfy 
Sheffer's condition 



f{z) exp{xu{z)) = ^ Qkix] 



z 



fe=0 

for some formal power series / and u with u(0) = 0, u'(0) 7^ 0, and /(O) 7^ 0. 
Schoutens introduces a time parameter t via 

00 ^ 
/(z)*exp(a;u(z)) =^Qfe(x,t) — 

fc=0 
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and shows how an infinitely divisible characteristic function, and thus a Levy pro- 
cess, can be defined by / and u under appropriate conditions. Building on Meixner's 
characterisation, five sets of orthogonal polynomials Qk{Xt, t) and associated Levy 
processes Xt are determined, which satisfy martingale equalities of the type 

nQk{Xu t) I Xs] = Qu{Xs,s), 0<s<t. 

This furnishes the connection between Sheffer (resp. Levy-Meixner) systems and 
condition There are five Levy-Meixner systems, constructed from Hermite 

polynomials, Charlier polynomials Cfc(x, /i), Laguerre polynomials L^i^\x), Meixner 
polynomials Mk{x; fi, q)^ and Meixner-PoUaczek polynomials Pk{x; ii, (^), respec- 
tively. The resulting Levy processes Xt are standard Brownian motion Bt, the 
standard Poisson process Nt, the Gamma process Gt, the Pascal process Pt, and 
the Meixner process Ht, respectively. See Schoutens |H] for the definitions of all 
these processes and families of orthogonal polynomials; we just note that the pro- 
cesses are characterized by the increment characteristic functions 

e-"V^ 

exp(e™-l), 
1 



E[exp(iu(i?f+i 


-Bt))] 


E[exp(iM(iVt+i 


-Nt))] 


E[exp(iu(G'f+i 


~Gt))] 


E[exp(i'u(Pt+i 


-Pt))] 


E[exp{m(Ht+i 


-Ht))] 



(l-iu)' 



1 - qc" 



< g < 1, 



cos(|7r — () 



cosh(2U — (2"" — C)i) 



< C < TT. 



Brownian motion is not of interest to use, since the corresponding last line of Table[T] 
has been established by GY j^i- As for the remaining four processes, in the light of 
condition the martingale relations [S] 



E 



[Ck{Nt,t)\Ns] = {^^y Ck{Ns,s), 



(6) 



(Gt 



(s-l) 



Gs] = L, 
is)k 



(Gs), 



nMk{Put,q) I Ps] = )-^Mk{Ps\s,q), 
(t)k 

E[PkiHt;t,C) I Hs]=PkiHs;s,(:), 



valid for < s < t, prompt us to choose the basis functions in Table [H (Note that 
{t)k = t{t + 1) . . . {t + k — 1) is the Pochhammer symbol.) When specializing the 



Process 


Notation 


Basis polynomials ijjnk{x) 


Parameters 


Standard Poisson 


Nt 


^nGk{x, tn) 




Gamma 


Gt 






Pascal 


Pt 


{tn)kMk{x]tn,q) 


< g < 1 


Meixner 


Ht 


Pk{x;tn,C) 


< C < TT 



Table 2. Levy-Meixner systems 
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bounds (HI) and ([5]) to our examples, we will require the orthogonality properties 

(7) I][Ck{Nt,t)CiiNt,t)]=t-''k\Ski, 

(8) E[4*)(G.)ip)(G.)] = ^^^^^^<5.., 

E[Mk{Pt;t,q)Mi{Pt;t,q)] = -^Ski, 

Wkq" 

as well as a way to express the squares of the basis functions as series of basis 
functions. We will denote by dki{tn) the connection coefficients in the expansion 

2k 

(9) ^nkjxY = y^^dki{tn)i^ni{x). 

i=0 

Where distinction is necessary, the connection coefficients corresponding to the 
four families in Table [2] will be written as d^i{tn), d'j^iitn), d^^{tn), and d^^{tn), 
respectively. The same superscripts will adorn other quantities to distinguish the 
four cases, viz. the Poisson, Gamma, Pascal, and Meixner process. Throughout the 
paper, we write c for various positive constants whose precise value is irrelevant. 

4. The Single-Period Problem 

We now state our main result about the single-period problem, where our only 
exercise times are = to < < ^2- 

Theorem 1. Suppose m ~ 2, that St is a Poisson process, and that the basis 
functions are as in line one of Tahle\^ Put {u,v) — (10,4). // the number N of 
paths and the number K of basis functions satisfy N > k'^^^^)^ for some positive 
e, then 

jim sup E[|/3-^p] = 0. 

IfN< then 



lim sup E[|/3-/3p] =oo. 

For the Gamma, Pascal, and Meixner processes, with their respective basis functions 
from Table\^ the same holds, if {u,v) is replaced by (8,8), (11,7), and (8,8), 
respectively. 

The announced critical rate log N/ log log N in Table [T] then follows from the 
fact that the solution of iV = K'^^ satisfies K ~ logiV/ loglog A^, see e.g. 
de Bruijn [5]. 

Looking at and ([5]), we begin the proof of Theorem [T] by bounding ||5'i|| and 
ll^'J-^ll, defined by ^ and Tabled] 

Lemma 2. The norms ||^'i|| and ||^'^^|| grow at most exponentially in all four 
cases (Poisson, Gamma, Pascal, and Meixner), except for < c^ and 

Proof. For the Poisson process we find, by ([7]) and Stirling's formula, 

fc=0 k=0 
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Similarly, in the Pascal case, we have 

K K 



1 1^- Kt-Dk :i 

fc=0 k=0 

The Meixner process is treated by the estimates 



and 



/c!2 

fe=0 

K L2*iJ 

fc=l i=l 
K L2tlJ 

fc=l i=l 



fc=l ^ ' fe=l 

It remains to deal with the Gamma case. The parameter i — 1 in the martingale 
property ([6]) is not quite compatible with the orthogonality relation ([8]) of the 
Laguerre polynomials. But by the formula [5] 

4"-^)(x)=i[")(.x)-Lt\(x) 

we obtain 



(10) ni^UGtMfiiGt,)] = { 



f-(T) k = i-i 

k-\-t\ \ k } 

|A:-;|>2 



hence ^I*^ is tridiagonal. As PH)) grows only polynomially in fc, it is clear that so 
does ll^'f ||. As for the inverse, note that is diagonally dominant, so that it 
suffices to bound the diagonal elements of (^ff (see Nabben [SJ Theorem 3.1]; 
note that the from that theorem are all equal to 1 in our situation.) The diagonal 
elements of (^'f )^^ can be computed recursively as [6] 

'^^ = 1cTtX K-i ) 



and 



efc-i,fc-i = — — [ k + t ^'"''^ ~ ' ' I < k < K. 



A straightforward backward induction shows that this implies 

\ekk\<i'iiti + l)f-^+'eKK, 0<k<K, 
hence IK^f)^^!! grows at most exponentially, too. □ 
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We proceed to bound the fourth order moments m Q. Using Q and the mar- 
tingale relation we obtain 

2j 2fc 

i=0 s=0 

1=0 8=0 

2j 2k 

(11) = EE^J*(^2)dfe.(ti)E[V;H(5tjVi.(^tJ]. 

1=0 s=0 

The connection coefficients from the expansion (jOj are well-studied objects for vari- 
ous families of orthogonal polynomials. See Zeng [TD] for some combinatorial prop- 
erties and explicit formulas. Paraphrasing one of these results [TUl Corollary 2], we 
have 

(12) dUtn)=C-'k\'^lY.■ 



s>0 



(13) d«(t„) = 22'=+^fcPz!^ 



(s - k)P{s - iy.{2k + i ~ 2sy.' 

{tn - l)s 



(14) dint,,) = {1 + qr+^kl'^l^^ ^ "^"'""^ 



(15) df,{t,,) = {-2 cot Cr'+'klHlY, 



fb^ (s-A;)!2(s-z)!(2fc + i-2s)!' 
(i„).(l + (cotC)-')^ 



4'*(s - fc)!2(s - i)!(2fc -I- i - 2s)!' 

Here it is understood that 1/n! — for n a negative integer, as is natural when 
extending the factorial by the Gamma function. Therefore, the sums in (fT^ - ((T5|) 
run from s = max{i, fc} to s = fc + [i/2j . 

Moment bounds in the Poisson case. By ([7]), (fTTjl . and (fT2|) . the sum on 
the right-hand side of ^ can be estimated by 
(16) 

K K K K 2min{fc,i} 

EEe[^.'.(^*^w.(^*.)']^^"EE E AY.^%]\Y.^l 

j=0 fc=0 j=0 fc=0 1=0 \s>0 y \s>0 

where 



(s-fc)!2(s-i)!(2fc + 2-2s)!' 

It is easy to see that h^j^^ ^ k+i+il^li k+i ^ ^ ^'^^ ^ ^ ^ — ^ — */2, and k > i — I, 
hence b^ - ^.^j increases in k under these conditions. From this we deduce that the 
s-sums in (1161) increase in j respectively k: 

E ^^^'^ ^ E ^k,i,k+l 

s=inax{i,A:} ^=max{'t — fc,0} 

l^/2\ 

— E ^k+l,i,k+l+l 

/=max{i — fc,0} 

fc+Li/2j+l /C+LV2J+1 

= E ^k+l,i,s — E ^fc+l,i,s- 

s=max{i,fc} + l s=max{i,fc+l} 
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Using this in ((TH]) yields (recall that c may change its value in each occurrence) 

K K 

(17) ^^E[v^p(^*j'V'r.(5.j'] 

j=0 k=0 

2K /K+lt/2\ 



y is-K)\^{s-iy.{2K + i-2sy. 

It is easy to see that the sunimand increases in i for K>0, Q<i<K and 
K < s < K + i/2. Hence we find that the portion X^iLo '^^ i-sum in (fTT)) can 
be bounded from above by 



/[3K/2] \ ^ 



The last equality holds because the summand in ()18p is unimodal with mode at 
s = K + K"^/^ — \K^^^ + 0(1). The remaining part of the i-sum in (fT7)) can be 
estimated by 



2K K+li/2\ 

1 1 .s I 



V (s-X)!2(s-z)!(2X + z-2s)! 

J i\{K+\i/2\)\ 

i=K+l 
2K 

~ ,=r+i(^+LV2j-*)!2 - ^ 



(19) E * 



U/2j!2(i^+ [i/2j -i)!(z-2[i/2j)! 
z!(K+ U/2J)P 



Note that in the first line we have introduced the new factor s! in the numerator. 
This makes the summand increasing w.r.t. the substituion i^« + l,s— ^s + l. 
Hence it suffices to keep only the summands of the s-sum with s = K + [j/2j, 
which shows the first inequality. As for the second inequality, note that the factor 
« ! / [i/2j ! 2 of the summand grows only exponentially, and that the factor ( i — 2 [i/2j ) ! 
in the denominator is clearly negligible. Finally, the last sum in ()19p has increas- 
ing summands, which together with Stirling's formula implies the last inequality. 
By (ini), the estimates (HSJ) and ([TH) show that 

X:EE[^r,(^*.M.(^.j^]<c^i^^''^. 

j=0 k=Q 

In the light of (jl]) and Lemma [H the value u = 10 for the Poisson Process in 
Theorem [1] is established. 

As for the second assertion about the Poisson process in Theorem [U note that, 
from mi]), 

2K 2k 

E[V'2K(5tjVife(5tj'] = ^^dK.(i2)dfcs(ii)E[V'H(5tJ^i.(5*J]. 

i=0 s=0 
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The orthogonality property ([7]) and formula p2p yield 

K K 2k 

fc=0 k=0 2=0 

> c^(2if)!3 > c^K^if 

The second inequality follows from retaining only the summand k — K, i — 2K . 
This makes the sum in collapse to the summand s — 2K, whence the third 
inequality. Appealing to ([5]) and Lemma [5] completes the proof of the Poisson part 
of Theorem [TJ Note that the preceding estimates can presumably be improved. 
This seems not worthwile, though; since our estimate of H^*! || in Lemma[2]is sharp, 
we will not obtain equal values it = f in Theorem [1] anyways, unless at least one of 
the bounds ^ and ((5]) were improved, too. 

Moment bounds in the Meixner case. The proofs in the remaining three 
cases are very similar to the Poisson case. In the Meixner case, we have 
(20) 

K K K K 2miii{fcj} / \ / 

j=0 fc=0 j=0 fc=0 i=0 \s>0 / \s>0 

where 

iM ^- ^■ 



''''' {s-ky?{s-i)\{2k + i-2s)V 

Again, increases in k, and the remaining steps to show the upper bound 

are completely analogous to the Poisson case. This time the numerator factor s! 
in the analogue of appears naturally, and is not introduced artificially to force 
some monotonicity. Moreover, the lower bound uses the same summands as in the 
Poisson case. Both resulting bounds are of the form K^^ , whence m = w = 8 in 
Theorem m 

Moment bounds in the Pascal case. We can reuse the values and the 
estimate that we just sketched: 

K K K K 2min{fe,i} 

j=0 fe=0 i=0 k=0 i=0 

K K 2min{fe,i} 

<c-KM2K)lY^Y. E 

The lower bound poses no new difficulties either. 

Moment bounds in the Gamma case. This part is only slightly more in- 
volved. Due to (fTO|) . we have three i-sums instead of one in the analogue of (fT7|) . 
The right-hand side of pil)l can be replaced by in each of these. Then one 
of the three i-sums equals the i-sum in (1201) . and the other two differ only in an 
index shift b^^j^-^ ^, which can be easily bounded by polynomial factors. Thus the 
resulting growth rate is K^^ , as for the Meixner case. The proof of Theorem [1] 
is complete. 

5. The Multi-Period Problem 

The following result shows that the critical growth rate log N/ log log N from 
Table [T] carries over to the multi-period setting. We assume that a representation 
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analogous to ^ holds at time tm , and that the payoff function does not grow too 
fast in the following sense. 

Theorem 3. Suppose that the payoff functions satisfy the growth constraint 

ft 

E[hn{St )^] < max — maxE[V'^fc(S't 0<n<m. 

v \ ti, J u,k 

Then 

sup E[|/3„-/3„p] < Ar-ic^x(™-"+i)"^, 1 < n < m, 

|/3™-i| = l 

where u takes on the same values as in Theorem]^ that is, 10, 8, 11, 8 for St the 
standard Poisson, Gamma, Pascal, and Meixner process, respectively. 

Proof. By results of GY 3J (Theorem 3 and the last formula before (18) on p. 2096) 
and Jensen's inequality, we have 

sup E[|/3„-/3„|2] 

|/3,„_i| = l 

<^ max ||*;i||3maxE[V^,fc(5tj4]m-nj^^^j,[^^^(^^j2]2 
<^ max ||vI/;i||3maxE[V'.fc(5*J^]"^"+^ 

iV l<i/<m i/.k 

Note that GY [3] assume that the moments E[iljnk{St^)'^] and Ei[tpnk{St^)'^] are 
increasing in n and fc, and formulate their Theorem 3 with E[7/)^^] instead of 
maxjy^fe E[i/)^^'*'']. But an inspection of their proof quickly shows that taking the 
max in the above estimate gets rid of the monotonicity assumption. Now note 
that II^E'^^ll < in all our four cases by Lemma [2 and that 

K K 

maxE[V'.fe(^tJ^] < max V VE[V'.j(5tj2^,,(5^j2] < j^uK ^ 

v,k V — ' ^ — ' 

where the double sum has been estimated in the proof of Theorem [T] □ 
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